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ENTERFn 

SEQUENCE LISTING ' 1 * LJ 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: TAKAKURA, Hikaru 

6 MORISHITA, Mio 

7 YAMAMOTO, Katsuhilco 

8 MITTA, Masanori 

9 AS ADA, Kiyozo 

10 TSUNASAWA, Susumu 

11 KATO, Ikunoshin 

13 (ii) TITLE OF INVENTION: HJTE^THE^MOSTABLE PROTEASE GENES 

15 (iii) NUMBER OF SEQUENCES : \42 

17 (iv) CORRESPONDENCE ADDRESS^ 

18 (A) ADDRESSEE: Browdy and Neimark 

19 (B) STREET: 419 Seventh Street N.W., Ste. 300 

20 (C) CITY: Washington 

21 (D) STATE: D.C. 

22 (E) COUNTRY: United States of America 

23 (F) ZIP: 20004 

25 (v) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Floppy disk 

27 (B) COMPUTER: IBM PC compatible 

28 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 
" 29 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

31 (vi) CURRENT APPLICATION DATA: 

C~> 32 (A) APPLICATION NUMBER: US/09/841,553 

C— > 33 (B) FILING DATE: 24-Apr-2001 

c __> 39 (vii) PRIOR APPLICATION DATA: 

36 (A) APPLICATION NUMBER: US 08/894,818 

37 (B) FILING DATE: 

40 (A) APPLICATION NUMBER: JP 323285/1995 

41 (B) FILING DATE: 12-DEC-1995 

4 3 (viii) ATTORNEY/AGENT INFORMATION: 

44 (A)- NAME: Browdy, Roger L. 

45 (B) REGISTRATION NUMBER: 25,618 

46 (C) REFERENCE/DOCKET NUMBER: TAKAKURA=1 

48 (ix) TELECOMMUNICATION INFORMATION: 

49 (A) TELEPHONE: (202) 628-5197 

50 (B) TELEFAX: (202) 737-3528 
53 (2) INFORMATION FOR SEQ ID NO: 1: 
55 (i) SEQUENCE CHARACTERISTICS: 
56* (A) LENGTH: 659 amino acids 

57 (B) TYPE: amino acid 

58 (C) STRANDEDNESS: single 

59 (D) TOPOLOGY: linear 
61 (ii) MOLECULE TYPE: peptide 
63 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
65 Met Lys Arg Leu Gly Ala Val Val Leu Ala Leu Val Leu Val Gly 
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68 Leu Leu Ala Gly Thr Ala Leu Ala Ala Pro Val Lys Pro Val Val 



5! Arg Asn Asn Ala Val Gin Gin Lys Asn Tyr Gly Leu Leu Thr Pro 
74 Gly Leu Phe Lys Lys Val Gin Arg Met Asn Trp Asn Gin Glu Val 
" Asp Thr Val lie Met Phe Gly Ser Tyr Gly Asp Arg Asp Arg Ala 



20 25 30 



80 Val Lys Val Leu Arg Leu Met Gly Ala Gin Val Lys Tyr Ser Tyr 

81 80 85 

83 Lys lie lie Pro Ala Val Ala Val Lys lie Lys Ala Arg Asp Leu 

Thr 

81 



65 



70 



8 6 Leu Leu He Ala Gly Met lie Asp Thr Gly Tyr Phe Gly Asn Thr 

si no 115 , 

89 Arg Val Ser Gly He Lys Phe lie Gin Glu Asp Tyr Lys Val Gin 



92 Val Asp Asp Ala Thr Ser Val Ser Gin lie Gly Ala Asp Thr Val 

95 Tr P Asn Ser Leu Gly Tyr Asp Gly Ser Gly Val Val Val Ala lie 

155 

98 Val Asp Thr Gly He Asp Ala Asn His Pro Asp Leu Lys Gly Lys 

101 Val lie Gly Trp'Jyr Asp Ala Val Asn Gly Arg Ser Thr Pro Tyr 

185 l y u 

r04 Asp Asp Gin Gly His Gly Thr His Val Ala Gly He Val Ala Gly 

111 Thr Gly Ser Val Asn Ser Gin Tyr He Gly Val Ala Pro Gly Ala 
108 215 220 



125 



U0 Lys Leu Val Gly Val Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 

230 

He Ala Gly Val Asp Trp 
245 250 



111 mv . „ T i Q zn=> nv Val Asd Trp Val Val Gin Asn Lys 

113 Val Ser Thr He He Ala Giy vaJ. asp ij-f 255 



111 230 235 

113 

H6 Asp Lys Tyr Gly lie Arg Val He Asn Leu Ser Leu Gly Ser Ser 
III Gin Ser Ser Asp Gly Thr Asp Ser Leu Ser Gin Ala Val Asn Asn 
122 Ala Trp Asp Ala Gly He Val Val Cys Val Ala Ala Gly Asn Ser 

125 Gly Pro Asn Thr Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys 

i o a 305 j 1U 

128 Val lie Thr Val Gly Ala Val Asp Ser Asn Asp Asn He Ala Ser 

129 320 325 

Gly 
335 



131 Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu 

o 335 

135 Val Val Ala Pro Gly Val Asp He He Ala Pro Arg Ala Ser Gly 



350 355 
138 Thr Ser Met Gly Thr Pro He Asn Asp Tyr Tyr Thr Lys Ala Ser 
139 



365 370 



file://C:\Crf3\Outhold\VsrI841553.htm 



6/14/01 



Page 3 of 7 



RAW SEQUENCE LISTING DATE: 06/14/2001 

PATENT APPLICATION: US/09/841,553 TIME: 12:35:12 



Input Set : A:\09841553.txt 

Output Set: N:\CRF3\06142001\I841553.raw 



1 A 1 


\Dxy 


i nr 






Ala 


Thr 


Pro 


His 


Val 


Ser 


Gly Val 


Gly Ala 


Leu 


1 A O 
1 H £ 








380 










385 










390 


1 A A 
14 4 


i le 


Leu 


r 1 n 
oXIl 




His 


Pro 


Ser 


Trp 


Thr 


Pro 


Asp 


Lys 


Val 


Lys 


Thr 


14 D 










395 










400 










405 


"\ A H 
14 / 


Aid 


Leu 


Tip 

lie 




Thr 


Ala 


Asp 


He 


Val 


Ala 


Pro 


Lys 


Glu 


He 


Ala 


1 A O 

14 o 










410 










415 










420 


150 


Asp 


lie 


Aia 


Tyr 


Gly Ala 


Gly Arg 


Val 


Asn 


Val 


T vr 


Lys 


Ala 


He 


151 






425 










430 










435 


153 


Lys 


Tyr 


Asp 


Asp 


Tyr 


Ala 


Lys 


Leu 


Thr 


Phe 


Alii. 


Gly 


Ser 


Val 


Ala 


154 






440 










445 










450 


156 


Asp 


Lys 


Gly 


Ser 


Ala 


Thr 


His 


Thr 


Phe 


Asp 


Val 


Ser 


Gly Ala 


Thr 


lb / 




455 










4 60 










465 


159 


Phe 


Val 


Thr 


Ala 


Thr 


Leu 


Tyr 


Trp 


Asp 


Thr 


vj-Ly 


C p, v- 




Asp 


He 


160 










470 










475 










480 


162 


Asp 


Leu 


Tyr 


Leu 


Tyr 


Asp 


Pro 


Asn 


Gly 


Asn 


Glu 


Val 


Asp 


Tyr 


Ser 


163 






485 










490 










495 


165 


Tyr 


Thr 


Ala 


Tyr 


Tyr 


Gly 


Phe 


Glu 


Lys 


Val 




T VT 


Tyr 


Asn 


Pro 


166 






500 










505 










510 


168 


Thr 


Ala 


Gly Thr 


Trp 


Thr 


Val 


Lys 


Val 


Val 


Ser 


Tvr 


Lys 


Gly Ala 


169 










515 










520 










525 


171 


Ala 


Asn 


Tyr 


Gin 


Val 


Asp 


V d -L 


Val 


Ser 


Asp 


C 1 v 
uiy 




Leu 


Ser 


Gin 


172 








530 










535 










540 


174 


Ser 


Gly Gly Gly 


Asn 


Pro 


Asn 


Pro 


Asn 


Pro 


Asn 


Pro 


Asn 


Pro 


Thr 


I/O 










545 










550 










555 


1 "7 *7 
1 / / 


Pro 


Thr 


Thr 


Asp 


Thr 


Gin 


Thr 


Phe 


Thr 


Gly 


Ser 


Val 


Asn 


Asp 


Tyr 


1 / O 








560 










565 










570 


1 ftO 
10U 


irp 


Asp 


Thr 


Ser 


Asp 


Thr 


Phe. 


Thr 


Met 


Asn 


Val 


Asn 


Ser 


Gly 


Ala 


1 ft 1 

J. O -L 






575 










580 










585 


183 


Thr 


Lys 


He 


Thr 


Gly Asp 


Leu 


Thr 


Phe 


Asp 


Thr 


Ser 


Tyr 


Asn 


Asp 


184 








590 










595 










600 


186 


Leu 


Asp 


Leu 


Tyr 


Leu 


Tyr 


Asp 


Pro 


Asn 


Gly 


Asn 


Leu 


Val 


Asp Arg 


187 








605 










610 










615 


189 


Ser 


Thr 


Ser 


Ser 


Asn 


Ser 


Tyr 


Glu 


His 


Val 


Glu 


Tyr 


Ala 


Asn 


Pro 


190 










620 










625 










630 


192 


Ala 


Pro 


Gly 


Thr 


Trp 


Thr 


Phe 


Leu 


Val 


Tyr 


Ala 


Tyr 


Ser 


Thr 


Tyr 


193 








635 










640 








Gly 


645 


195 


Gly 


Trp 


Ala 


Asp 


Tyr 


Gin 


Leu 


Lys 


Ala 


Val 


Val 


Tyr 


Tyr 




196 






650 










655 












199 


(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 


2: 















201 (i) SEQUENCE CHARACTERISTICS: 

202 (A) LENGTH: 1977 base pairs 

203 (B) TYPE: nucleic acid 

204 (C) STRANDEDNESS: double 

205 ( D) TOPOLOGY: linear 

207 (ii) MOLECULE TYPE: genomic DNA 

209 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

211 ATGAAGAGGT TAGGTGCTGT GGTGCTGGCA CTGGTGCTCG TGGGTCTTCT GGCCGGAACG 60 
213 GCCCTTGCGG CACCCGTAAA ACCGGTTGTC AGGAACAACG CGGTTCAGCA GAAGAACTAC 120 
215 GGACTGCTGA CCCCGGGACT GTTCAAGAAA GTCCAGAGGA TGAACTGGAA CCAGGAAGTG 180 
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217 GACACCGTCA TAATGTTCGG GAGCTACGGA GACAGGGACA GGGCGGTTAA GGTACTGAGG 240 

219 CTCATGGGCG CCCAGGTCAA GTACTCCTAC AAGATAATCC CTGCTGTCGC GGTTAAAATA 300 

221 AAGGCCAGGG ACCTTCTGCT GATCGCGGGC ATGATAGACA CGGGTTACTT CGGTAACACA 360 

223 AGGGTCTCGG GCATAAAGTT CATACAGGAG GATTACAAGG TTCAGGTTGA CGACGCCACT 4 20 

225 TCCGTCTCCC AGATAGGGGC CGATACCGTC TGGAACTCCC TCGGCTACGA CGGAAGCGGT 4 80 

227 GTGGTGGTTG CCATCGTCGA TACGGGTATA GACGCGAACC ACCCCGATCT GAAGGGCAAG 540 

229 GTCATAGGCT GGTACGACGC CGTCAACGGC AGGTCGACCC CCTACGATGA CCAGGGACAC 600 

231 GGAACCCACG TTGCGGGTAT CGTTGCCGGA ACCGGCAGCG TTAACTCCCA GTACATAGGC 660 

233 GTCGCCCCCG GCGCGAAGCT CGTCGGCGTC AAGGTTCTCG GTGCCGACGG TTCGGGAAGC 720 

235 GTCTCCACCA TCATCGCGGG TGTTGACTGG GTCGTCCAGA ACAAGGACAA GTACGGGATA 780 

237 AGGGTCATCA ACCTCTCCCT CGGCTCCTCC CAGAGCTCCG ACGGAACCGA CTCCCTCAGT 840 

239 CAGGCCGTCA ACAACGCCTG GGACGCCGGT ATAGTAGTCT GCGTCGCCGC CGGCAACAGC 900 

241 GGGCCGAACA CCTACACCGT CGGCTCACCC GCCGCCGCGA GCAAGGTCAT AACCGTCGGT 960 

24 3 GCAGTTGACA GCAACGACAA CATCGCCAGC TTCTCCAGCA GGGGACCGAC CGCGGACGGA 1020 

245 AGGCTCAAGC CGGAAGTCGT CGCCCCCGGC GTTGACATCA TAGCCCCGCG CGCCAGCGGA 1080 

247 ACCAGCATGG GCACCCCGAT AAACGACTAC TACACCAAGG CCTCTGGAAC CAGCATGGCC 1140 

249 ACCCCGCACG TTTCGGGCGT TGGCGCGCTC ATCCTCCAGG CCCACCCGAG CTGGACCCCG 1200 

251 GACAAGGTGA AGACCGCCCT CATCGAGACC GCCGACATAG TCGCCCCCAA GGAGATAGCG 1260 

253 GACATCGCCT ACGGTGCGGG TAGGGTGAAC GTCTACAAGG CCATCAAGTA CGACGACTAC 1320 

255 GCCAAGCTCA CCTTCACCGG CTCCGTCGCC GACAAGGGAA GCGCCACCCA CACCTTCGAC 1380 

257 GTCAGCGGCG CCACCTTCGT GACCGCCACC CTCTACTGGG ACACGGGCTC GAGCGACATC 1440 

259 GACCTCTACC TCTACGACCC CAACGGGAAC GAGGTTGACT ACTCCTACAC CGCCTACTAC 1500 

261 GGCTTCGAGA AGGTCGGCTA CTACAACCCG ACCGCCGGAA CCTGGACGGT CAAGGTCGTC 1560 

263 AGCTACAAGG GCGCGGCGAA CTACCAGGTC GACGTCGTCA GCGACGGGAG CCTCAGCCAG 1620 

265 TCCGGCGGCG GCAACCCGAA TCCAAACCCC AACCCGAACC CAACCCCGAC CACCGACACC 1680 

2 67 CAGACCTTCA CCGGTTCCGT TAACGACTAC TGGGACACCA GCGACACCTT CACCATGAAC 17 4 0 

269 GTCAACAGCG GTGCCACCAA GATAACCGGT GACCTGACCT TCGATACTTC CTACAACGAC 1800 

271 CTCGACCTCT ACCTCTACGA CCCCAACGGC AACCTCGTTG ACAGGTCCAC GTCGAGCAAC 1860 

273 AGCTACGAGC ACGTCGAGTA CGCCAACCCC GCCCCGGGAA CCTGGACGTT CCTCGTCTAC 1920 

275 GCCTACAGCA CCTACGGCTG GGCGGACTAC CAGCTCAAGG CCGTCGTCTA CTACGGG 1977 
278 (2) INFORMATION FOR SEQ ID NO: 3: 

280 (i) SEQUENCE CHARACTERISTICS: 

281 (A) LENGTH: 522 amino acids 

282 (B) TYPE: amino acid 

283 (C) STRANDEDNESS : single 

284 (D) TOPOLOGY: linear 
286 (ii) MOLECULE TYPE: peptide 

C~> 288 (ix) FEATURE: 

w > 289 OTHER INFORMATION: 

W — > 290 Xaa at position 428 is Gly or Val . 

291 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

293 Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala 

294 5 10 15 

296 Thr Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly He Thr He 

297 20 25 30 



299 Gly lie lie Asp Thr Gly He Asp Ala Ser His Pro Asp Leu Gin 

300 35 

302 Gly Lys Val He Gly 

303 50 



300 35 40 45 

302 Gly Lys Val He Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr 

55 60 
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305 Pro Tyr Asp Asp His Gly His Gly Thr His Val Ala Ser He Ala 

306 65 70 75 

308 Ala Gly Thr Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala 

309 80 85 90 

311 Pro Gly Ala Lys Leu Ala Gly He Lys Val Leu Gly Ala Asp Gly 

312 95 100 105 

314 Ser Gly Ser He Ser Thr lie lie Lys Gly Val Glu Trp Ala Val 

315 110 115 12° 

317 Asp Asn Lys Asp Lys Tyr Gly lie Lys Val lie Asn Leu Ser Leu 

318 125 130 135 

320 Glv Ser Ser Gin Ser Ser Asp Gly Thr Asp Ala Leu Ser Gin Ala 

321 140 145 150 

323 Val Asn Ala Ala Trp Asp Ala Gly Leu Val Val Val Val Ala Ala 

324 155 160 165 

326 Gly Asn Ser Gly Pro Asn Lys Tyr Thr lie Gly Ser Pro Ala Ala 

327 170 175 180 

329 Ala Ser Lys Val lie Thr Val Gly Ala Val Asp Lys Tyr Asp Val 

330 185 190 195 

332 He Thr Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu 

333 200 205 210 

336 Lys Pro Glu Val Val Ala Pro Gly Asn Trp He He Ala Ala Arg 

337 215 220 225 

339 Ala Ser Gly Thr Ser Met Gly Gin Pro He Asn Asp Tyr Tyr Thr 

340 230 235 240 

342 Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His Val Ala Gly He 

343 245 250 255 

345 Ala Ala Leu Leu Leu Gin Ala His Pro Ser Trp Thr Pro Asp Lys 

346 260 265 270 

348 Val Lys Thr Ala Leu He Glu Thr Ala Asp He Val Lys Pro Asp 

349 275 280 285 

351 Glu He Ala Asp He Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr 

352 290 295 300 

354 Lys Ala He Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly 

355 305 . 310 315 

357 Tyr Val Ala Asn Lys Gly Ser Gin Thr His Gin Phe Val He Ser 

358 320 325 330 

360 Gly Ala Ser Phe Val Thr Ala Thr Leu Tyr Trp Asp Asn Ala Asn 

361 335 340 345 

363 Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gin Val 

364 350 355 360 

366 Asp Tyr Ser Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr 

367 365 370 -3/ 5 

369 Tyr Asn Pro Thr Asp Gly Thr Trp Thr He Lys Val Val Ser Tyr 

370 380 385 390 

372 Ser Gly Ser Ala Asn Tyr Gin Val Asp Val Val Ser Asp Gly Ser 

373 395 ■ 400 405 

375 Leu Ser Gin Pro Gly Ser Ser Pro Ser Pro Gin Pro Glu Pro Thr 

376 410 415 420 
W— > 378 Val Asp Ala Lys Thr Phe Gin Xaa Ser Asp His Tyr Tyr Tyr Asp 
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PATENT APPLICATION: US/09/841,553 



DATE: 06/14/2001 
TIME: 12:35:13 



Input Set : A:\09841553.txt 
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L:32 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER: ]" 
L:33 M:220 C: Keyword misspelled or invalid format, [(B) FILING* DATE : ] 
L:35 M:220 C: Keyword misspelled or invalid format, [ (vii) PRIOR APPLICATION DATA:] 
L:39 M:220 C: Keyword misspelled or invalid format, [(vii) PRIOR APPLICATION DATA:] 
L:288 M:220 C: Keyword misspelled or invalid format, [ (ix) FEATURE : ] 

L:289 M:246 W: Invalid value of Alpha Sequence Header Field, Data=[Xaa at position 428 is Gly 
or Val. ] 

L:291 M:246 W: Invalid value of Alpha Sequence Header Field, [FEATURE:], SeqNo=3 
L:378 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:3 

L:411 M:220 C: Keyword misspelled or invalid format, [ (ix) FEATURE : ] 

L:412 M:246 W: Invalid value of Alpha Sequence Header Field, Data=[N at position 1283 is G or 
T.] 

L:414 M:246 W: Invalid value of Alpha Sequence Header Field, [FEATURE:], SeqNo=4 
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